Quick look at Dr. Adeleke’s 2022 Study data

Erin Stearns

2022-09-14

Quick glance

Dr. Adeleke’s study data can be found on the GDx here.

Summary of data by state

State Sites
Abia 26
Adamawa 30
Akwa Ibom 26
Anambra 13
Bauchi 20
Bayelsa 12
Benue 12
Borno 4
Cross River 18
Delta 23
Ebonyi 28
Edo 22
Ekiti 30
Enugu 17
FCT 20
Gombe 19
Imo 27
Jigawa 19
Kaduna 24
Kano 23
Katsina 15
Kebbi 8
Kogi 16
Kwara 13
Lagos 3
Nasarawa 10
Niger 14
Ogun 22
Ondo 23
Osun 32
Oyo 21
Plateau 19
Rivers 20
Sokoto 14
Taraba 15
Yobe 14
Zamfara 22

Data processing

Starting dataset dimensions: 815 rows, 25 columns

  1. Remove ‘test’ rows
    • username = test, training_user, or state_1
    • Data dimensions after removing ‘test’ rows: 714 rows
  2. Prepare community coordinate data
    • Currently a string of 4 values (lat, long, elevation, & precision), so split that into 4 individual fields
    • Transform fields to numeric fields (formerly character)
    • Remove rows missing lat/long for communities (20 rows)
    • Data dimensions after removing missing coordinates: 694 rows

Questions for Dr. Adeleke

  • There are 2 fields with coordinates: r_GPSCommunity & r_GPS_river_basin - were the coordinates from the r_GPSCommunity from the actual fly collection site or are these village centroids?

  • Why are some rows missing community coordinates?

Questions for Dimagi

  • Could Dimagi please filter out ‘test’ rows prior to posting data?
    • Is there a consistent flag depicting a ‘test row’?